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Abstract 


In this paper, an extension of record models, well known as k-records, 
is considered. Bayesian estimation as well as prediction based on k-records 
are presented when the underlying distribution is assumed to have a general 
form. The proposed procedure is applied to the Exponential, Weibull and 
Pareto models in one parameter case. Also, the two-parameter Exponential 
distribution, when both parameters are unknown, is studied in more details. 
Since the ordinary record values are contained in the k-records, by putting 
k = 1, the results for usual records can be obtained as special case. 
Keywords and phrases: Admissibility; Bayes prediction; Bayesian esti- 
mation; conjugate prior; weibull distribution. 
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1 Introduction 


Let {X;, 7 > 1} be a sequence of independent and identically distributed (iid) con- 


tinuous random variables each distributed according to cumulative distribution 
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function (cdf) F(t) and probability density function (pdf) f(t). An observation 
X; will be called an upper record value if its value exceeds that of all previous ob- 
servations. Thus, X,; is an upper record if X; > X; for every i < j. An analogous 
definition can be given for lower record values. Today there are over 500 papers 
and several books published on record-breaking data (see, for instance, Chandler 
[8], Resnick [19], Shorrock [21], Glick [14], Samaniego and Whitaker [20], Arnold 
et al. [5] and Nevzorov [18}]). 

There are several situations where the second or third largest values of special 
interest, insurance claims some non-life insurance can be used as an example, see 
Kamps [16], so the usual record models is inadequate. Also, in the ordinary 
record value theory, while inverse sampling considerations have given valuable 
insights, their practical implementation is greatly hindered by the sparsity of 
records. These problems caused the researchers to study the theory of k-record 
models. Upper k-record process is defined in terms of the k-th largest X yet seen. 
For a formal definition, we consider the definition in Arnold et al. [5], p. 43, in 


the continuous case, let T)(,) = k, Ryn) = X1-~ and for n = 2, let 


Late) = min{j 19> T—1(k) ¥5 > DET igh SRE UTS Sips 


where X;j-m denotes the i-th order statistic in a sample of size m. The sequence 
of upper k-records is then defined by Ry(p) = XT (4) —k+L:T a(x) for n > 1. Arnold 
et al. [5] call this a Type 2 k-record sequence. For k = 1, note that the usual 
records are recovered. An analogous definition can be given for lower k-records as 
well. This sequence of k-records was introduced by Dziubdziela and Kopocinski 
[12] and it has found acceptance in the literature. Some work has been done on 
the statistical inference, based on k-records. See, for instance, Deheuvels and 
Nevzorov [11], Berred [7], Ali Mousa et al. [4], Malinowska and Szynal [17], 
Danielak and Raqab [9],[10], Ahmadi et al. [2], Fashandi and Ahmadi [13] and 
references therein. 

We assume that this type of k-record data is available and the aim of this 


paper is to develop inference methods as well as prediction of future k-records 
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based on past observed k-records. The rest of the paper is organized as follows. 
In Section 2, Bayesian estimation as well as prediction based on k-records are 
presented when the underlying distribution is assumed to have a general model. 
In Section 3, a two-parameter exponential distribution is considered; the maxi- 
mum likelihood and Bayes estimators for the unknown parameters, are obtained. 
Bayesian prediction of the future k-records, either point or interval, are obtained 
in Section 4, when the k-records are assumed to come from the two-parameter 


exponential model. 


2 A General Model 


In this section, we consider the problems of estimation and prediction based on 
k-records, when the underlying distribution has a general form. In order to do 
this, let C' be the class of all absolute continuous distribution functions F’ of the 


form 


Fo(z) =1-—e-), «>0, (2.1) 


where \),(x) (the derivative of Ag(x) w.r.t #) exists and is a positive function of 


§ and x. Then 
fo(z) = Xp(a)e-™, ae > 0. (2:2) 


This class includes several important life time families such as: Exponential, 
Weibull, compound Weibull, Pareto, Beta, Gompertz, compound Gompertz and 
Burr type XII, among others. 

2.1 Estimation 

Using the joint pdf of usual records, we readily have the joint density of the first 


m, k-records Ry(p), Rag), ---, Rm(k) as 


f(@15 052m) = k™ II io F@) — F(am))*, (2.3) 
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(see, Arnold et al. [5]). Now, suppose we observe Rix) = £1,°++ ; Rik) = Lm 


then by substitution of (2.1) and (2.2) in (2.3), the likelihood function L(@) is 
L(0) x A(0;x)e7 BO#m) (2.4) 
where x = (%1,-°-: , 2m), 
A(6;x) = ences and: BO; 4m) = kAglem)- 
i=1 


There is clearly no way in which one can say that one prior is better than any 
other. Presumably one has own subjective prior and must live with all of its lumps 
and bumps. It is more frequently the case that we elect to restrict attention to 
a given flexible family of prior distributions and we choose one from the family 
which seems to the best of our match and personal believes. With this in mind, 
let the conjugate prior density function for 6, proposed by AL-Hussaini [3], is 
given by 

(0; 5) « C(0;5)e"P), GE 0, FEN, (2.5) 


where (Q. is the prior parameter(s) space. Then the posterior density function is 
derived as 


(6x) = C1(M, N)M(0;x, d)e7N(% 29) (2.6) 


where 


M(0;x,6) = C(0@;6)A(6;x), 
N(6;%m,6) = D(6;6)+ B(6,2m), 


and C(M, N) is the normalizing constant given by 


—1 
Ci(M, N) = | i M(0;x,5)e7 NG em, Dag} (2.7) 
(2) 


If © is one dimensional then the Bayes estimator of 0, under squared error (SE) 


loss function, is 
, C\(M, N) 
A290 = — 2.8 
Be Oy( Me NY’ 28) 
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where M*(x) = cM(z). 


Remark. 63 in (2.8) is the unique Bayes estimate of 9 under SE loss function 


with respect to the above mentioned proper prior and hence is admissible. 


Example 2.1 (One-Parameter Exponential Model) 


Let Ag(x) = (x — po)/o where jp is known and 6 = a, i.e. we have a one 
parameter exponential distribution. Then A(@;x) = 1/o” and B(6;%m) = 
k(%m — Uo)/o. It can be shown that the maximum likelihood estimation of o 
is @y = k(Rmk) — Mo)/m. We use Inverted Gamma with parameters a and 
b as the conjugate prior, ie. a(o) = b¢o~ +) exp{—b/o}/I(a), where from 
(2.5), C(0;6) = 0 +2), D(6;5) = b/o and 6 = (a,b). Therefore, M(0;x, 65) = 
1/o™*2*! and N(0; am, 6) = (b+k(am—pUo))/o. From (2.8), the Bayes estimate 


of o under SE loss function is given by 


Be b+ k( Rk) — Ho) 
BS pee) : 
It may be noted that, from (2.6), the posterior distribution of o~! is 1(m+a, b+ 


K(am al Lo))- 


Example 2.2 (Weibull Model) 


Suppose Ag(x) = ax, where 3 is known and 6 = a. Then Xj,(x) = aBx?-!. It 
can be shown that the maximum likelihood estimate of a is Gy = m/ (kR? (K)): 
Assuming a Gamma conjugate prior with parameter a and 8, i.e. 

m(a) = b¢a*~! exp{—ba}/T(a), the Bayes estimate of a under SE loss function 
is given by 


aps = (m+a)/(KRy gy +)- 
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Example 2.3 (Pareto Model) 


In this model, Ag(x) = aln(#/) where 2 is known and 6 = a. So, by (2.4), max- 
imum likelihood estimate of a is dy = m/(kIn[ Rmx) /G]). Assuming a conjugate 
prior Gamma with parameters a and b, i.e. t(a) = b2a%! exp{—ba}/T(a), the 


Bayes estimate of a under SE loss function is given by 


és _ mta 
BS ~ kIn[Rmna/B] +b 


When both of the parameters in the above examples are unknown, In [1] we 


have obtained similar results based on usual records (k = 1). 


2.2 Prediction 


Assume that we have the first m upper k-records Ry(,) = ©1, Roz) = T2,---; Rm(k) = 
Xm from a member of class C in (2.1). Based on such a sample, prediction, ei- 
ther point or interval, is needed for s-th upper k-record, 1 < m < s. Now, let 


Y = Rgz) be the s-th upper k-record value, s > m. The conditional pdf of Y 


for the given vector parameter @ and that the first m k-record Ryx),--+ , Rmx) 
is given by 
ao gia AO) =o han) inte Bagley se) 


Hence, from equations (2.6) and (2.9) we get the Bayes predictive density function 


of Y 


h*(y|x) 


[ Plylx, 0) (6]x)d0 
(S) 


- ke-™C(M, N) 
— Tessa (2.10) 


J 8£06:2,5)[do(u) — Aolam)}*PEA9 (yer HPC II-M Oma, 
9 
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The Bayes point predictor of the s-th upper k-record based on the first m (m < s) 


observed k-records is given by 
A +00 
Yes = i yh" (y|x)dy 
= ks eae a N) oe fom (0: x, d)e N(0;2m,6) 


ee y[ro(y) — Ao(@in)]eAp(ye KO) m=—). dy dB 


2 ks- oe ee Mtb, dyer N(0;2m,9) 


= 
ie + delim) tea} do 
0 


2 CuM,N) f M(b:x,5)e-¥ (6:09) B{\=1(Z 4 dg (am))} dB, 
(°) 
(2.11) 


where Z ~T(s—m,k) and Xj ‘(2) is the inverse function of g(a). 


Example 2.4 (Continuation Examples 2.1-2.3) 


Using (2.11), we obtain the Bayesian point prediction of R,(,) for the following 


three models. We have 


i. One parameter Exponential model: 


lii. Pareto model: 


[b+ klog( Rina) /B)]"** 
I'(m + a) 
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+ sta-l  —[b+klog(R 
where I(Rim(k)) = So 7 o-I]R=re [+ log(Rm(xy/Plo doy, 
Remark. It may be noted that one may use (2.10) to obtain Bayesian prediction 
interval for R(x). 
In the rest of the paper, we consider two-parameter Exponential distribution 
which does not belong to the class C in (2.1), where the location parameter p is 


unknown. Its cdf and pdf are given by 
F(a;p,0) =1—- ez(@-K) g> bb, a >0, (2.12) 


and 


1 
eo) = He s@-) g> je oO 0; (2.13) 
o 


respectively, which is denoted by X ~ Exp(u,o). Ahmadi et al. [2] studied 
the problem of estimation and prediction in Exp(u,o) under LINEX (LINear- 


EXPonential) loss function based on k-records from Bayesian view point. 


3 Estimation in Exponential Model 


As mentioned in Section 1, the usual record data are rare in practical situations. 
In fact, the expected waiting time is infinite for every record after the first; but, 
this problem will be fixed by considering k-records instead (see Theorem 2.1 
of [15]). So, in this section, we shall be concerned with estimation of the two 
unknown parameters 4. and o of Exp(,o) based on k-record values. Suppose, 
we observed the first m upper k-records Riz) = 21, Rog) = 2, ++ Rm(k) = Lm 
from an Exp(u,o).Then from (2.3), (2.12) and (2.13) the likelihood function is 
given by 


k 
L(p,0|x) = (reretnnh), ESC TS awe Oe 0; (o:1) 


where x = (21, 22,...,2m)- 
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3.1 Maximum likelihood estimation 


In the case k = 1, the MLE (maximum likelihood estimation) of the two-parameters 
of the Exponential distribution can be found in Arnold et al. [5], p. 123. We 
obtained MLE based on k-record values, by (3.1). The natural logarithm of (3.1) 


is given by 
k 
l=mlnk—mlno ——(t4m—p), BS 21 <22<...< 0m. (3.2) 
o 


Assume that the parameters 4 and o are unknown, from (3.2) we readily obtain 


the MLE of u and oa as follows: 


fim = Rye), (3.3) 

and 
i R R 3.4 
M = ~(Rm(n) — Rice))- (3.4) 


It is easy to verify that 

© Ri) ~ Exp(u,o/k), 

e Rik) — Riz) and Rip) are independent random variables, 

e Rie) — Riz) has gamma distribution with parameters m — 1 and k/o. 
Then by (3.3) and (3.4) we have 

e Elim) = ut §, 

e MSE(jinr) = 25. 

Also, 

e E(éy) = ™, 

e MSE(ém) = = do not depend on k. 

e Cov(fim,om) = 0. 

Notice that fijy is a biased estimator pz, while an unbiased estimator for yu is given 
by 


m+k-1 k 
= Rick) 


m—1 Fem(h)- 
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3.2 Bayes estimation 


Our aim is to obtain Bayes estimate of the unknown parameters based on 7}, ..., 2m 
under SE loss function. We consider the following two cases for our Bayesian es- 
timation problem. 


a) a is known. 


Without loss of generality, we may assume o = 1 then by (3.1), we have 
f(x|u) = hem) cay <9 <... < am. (3.5) 


Assume the Jeffreys non-informative prior distribution (see [6]) of the parameter 
pin the form 


m(pt) oc 1. (3.6) 


Hence the posterior distribution of ju is 


m(plx) o f(x|H)r(H), 


where f(x|j) is the joint density function given by (3.5) and (yw) is the prior 
density given by (3.6). So, we have 


m(ukx) = ke") < ay. (3.7) 


Suppose an SE loss function, the Bayes estimate of a parameter is its posterior 


mean. Therefore, by (3.7), the Bayes estimate of the parameter ju is given by 


* 1 
fags = Ryn) — 7 (3.8) 


From Eq. (3.8) we get 


e E(fips) =p, 
e MSE(figs) = z@- 


b) o is unknown. 
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Under the assumption that both of the parameters 44 and o are unknown, we 
may consider the joint density as a product of the conditional density of ju for 
given o and a two parameter inverted gamma density for 0. So, we have 


(fo) agen (3.9) 


In fact o~' ~ P'(a, 3), which is the conjugate prior distribution of the parameter 
o for the fixed value of 1, and 71(u\7) x o~! which is the Jeffreys non-informative 
prior distribution (see [6]) of the parameter jy for fixed value of the parameter o. 
Thus, the joint posterior density is given by 

kG +k(@m—x1)I"F* 1 —3[6+k(em—H)] 


™(, o|X) = T(m re a) gintat2 © 


(3.10) 


Therefore, by (3.10) under SE loss function the Bayes estimate of the parameter 


a is given by 
B+ k(Rmk) — Ricey) 
mt+ta-1 ; 


6oBS = (3.11) 


Notice that, as > 0 and a— 1, Geps — Gy. By (3.11) we have 


e E(é2pg) = Stemeh 


° MSE(62p5) = eee: 


m-1 o2 | 
(m+a—1)? " (m+ta—1)?" 


Also, the Bayes estimate of the parameter pz is given by 


m+a 
k(m+a-—1) 


fioBs = Rmry + B [B+ k(Rmky — Racey)]- (3.12) 


By (3.12), we have 
e E(fizps) =w+ag+(l esas, 
¢ Var(fiors) = S[l+ att, 


m— o? 
e Cov(/i2Bs,02Bs) = woe 


Remark. It may be noted that one may use (3.7) and (3.10) to obtain Bayesian 


estimation interval for the parameters jz and o. 
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4 Prediction in Exponential Model 


In this section, we consider the problem of prediction, either point or inter- 
val, for future k-record values by Bayesian approach. Assume that we have 
the first m upper k-records Ry,) = £1, Rap) = 22,-.,Rm(e) = Lm from the 
Exp(p,0)—distribution. Based on such a sample, prediction, either point or in- 
terval, is needed for s-th upper k-record, 1 < m < s. We consider the following 


two cases: 


a) o is known. 


Without loss of generality, we may assume o = 1, then by (2.12), (2.13) and 
(2.9), we have 


* k s—m—1,—k(y—-2x 
F*(yla@m, B) = T(s—m) (y — Lm) ene a, Y>Xm, (4.1) 
which is independent of jz. So by (2.10) and (4.1) we have 
a1 
mtu) =f Fula) us 
keo™ s—m-—1_—k(y-2m) 
= ream — £m) € » Y>In, (4.2) 


for any posterior distribution (therefore, for any prior distribution) 7(u|x). By 
(4.2), we have 
Y — 2tm|x ~I(s —m,k). 


So, 


By (4.3) we have 
0 (Yi) =H+§, 
e MSE(Y%) = =. 
Ahmadi et al. [2] obtained the 100(1 — y)% Bayesian prediction interval for 


Re %), with equal tail as (Z1, Uj), where LZ; and U; are the lower and upper 
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bounds, respectively which are given by 
2 


4 
Ly = Rice) + op 
and 
X=? 
U, = Rin(k) + oh 


where ne stands for the y-th percentage of Chi-square distribution with 2(s —m) 


degrees of freedom. 
b) o is unknown 


Let Y = Rg) be the s-th upper k-record value, 1 < m < s. So, by (2.12) 
and (2.13), we have 


k = ie ee ae 
F*(ybes tia) = (Sym bao ram) (4.4) 


By (2.13), (3.10) and (4.4) Bayesian predictive density function of Y = Rep) , 


for the given past m records, is given by 


h(ylx) = a i PP (ulx, uo) (us, 0|x)dodp 
1 


k(@m — 21) + B m+a 
B(m+a,s—m)* k(y— 21) +6 
k(@m — 41) + 6 s—m 1 
k(y — 21) + B Y — Lm 
Now, by (4.5) the Bayes point predictor of the s-th upper k-record is given by 


x (1 US (4.5) 


% sta-l s—m B 
Yo ae | 
By (4.6) we have 
eB(y,) =the solo ee m) 
y a? m—1)(s—m a 
o MDE Na) = (srs gaan ake) tHeomGca =a) b 


In this case, Ahmadi et al. [2] also derived a Bayesian prediction interval for 
Rp) as follow: The 100(1 — y)% Bayesian prediction interval for R.(,) is given 
by 

(L2, U2), 
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where 
La = Ot Ey Dt Ray 
and 
i a Raw , ae 1) + Ri), 


where b, is the y-th percentage of Beta(m + a, s — m)-distribution. 


5 Conclusion 


In this paper, we have tackled the problems of estimation and prediction based on 
k-record data while the underlying distribution is assumed to have a general form. 
This family contains several life distribution such as Exponential, Weibull and 
Pareto and so on. A general form of conjugate prior was considered to obtain 
Bayesian estimation of unknown parameters and prediction of future k-record 
values. The proposed procedure was applied to the Exponential, Weibull and 
Pareto models in one parameter case. Moreover, we have developed the proposed 


procedure for two-parameter Exponential distribution in details. 
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